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Several theorems are proved that give necessary and sufficient conditions for A-li to 
be positive semidefinite Ilermitian. The conditions are in terms of the comparison of 
elementary symmetric functions of the characteristic roots of A-\-X and B-\-X as X varies 
over positive definite Ilermitian matrices. 



1. Introduction 

In a recent paper [4] 2 AT. Stone obtained the fol- 
lowing result in reproving certain theorems of 
Ehrenfeld [1] : if F and G are positive definite n-square 
Hermitian matrices, then 



d(I+AF)>d(I+AG) 



(1) 



for all positive definite n-square Ilermitian matrices A 
if and only if F—G is positive semidefinite Hermitian. 
Here d denotes determinant and henceforth yl>0 
(>0) will signify that A is positive (positive semi- 
definite) Hermitian. F>G (F>G) will mean F-G 
>0 (F—G>0). Note that since A>0 if and only if 
^4 _1 >0 the condition (1) is the same as saying 



d(A+F)>d(A+G) 



(2) 



for all ^L>0. It is in the form (2) that we investi- 
gate what happens when we replace d by some other 
elementary symmetric function of the characteristic 
roots. To be specific, suppose E r (x u . . . , x n ) 
denotes the r th elementary symmetric function of the 
indicated variables: 



E r (#i, - - - , x n )-- 



- £ n x u . 

l<h<...<ir<n t=l 



For a fixed r, l<r<n, let E r (A) denote S r (Xi(^4), 
. . . , K(A)) where \i(A), i=l, ... , n, are the 
characteristic roots of A. If the \ t (A) are real we 
will choose our notation so that \i(A)> . . . > 
X w (^4). Then the problem we pose is to find con- 
ditions on the characteristic roots of F and G such 
that 

E r (A+F) >E r (A+G) for all A>0. 

We have 

Theorem 1. Assume F>0, G>0, and 1 <r<n. 
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Tj 

E f -MF), . . ., \,+i(F))>E r -MG), 

p=0,...,r-l 

it follows tlvat 

E T {A + F) > E r (A + G) jar all A>0. 
In case G=7, (3) becomes 
E r . P (\„(F), ■ ■ -, \,+i(F))>(£2*y P =0, ■ 



which is both necessary and sufficient for 

E r (A + F) > E, (A + 1) for all A>0. 



K- P (G)), 
(3) 



(4) 



(5) 



(6) 



We remark that for r = n the second part of 
theorem 1 simply becomes: \ n (F)>l if and only if 
d(A+F)>d(A + I) for all A>0. Now in (1) 
multiply both sides by d(A~ l R*R) where R is a non- 
singular matrix satisfying R*GR=I, R*FR=K 
and we find that 

d(R*A~ l R+K) >d(R*A- l R+l). 

Now as A runs over all positive definite matrices so 
does R*A~ 1 R and from the above remark we con- 
clude that \ n (K)>l. The characteristic roots of K 
are just the characeristie roots of G~ l F and thus 



G- 1/2 FG~ 1/2 > L 



F>G, 



and the result in [4] follows. A somewhat stronger 
multiplicative analogue of theorem 1 is available. 

Theorem 2. Assume F>0, G>0, and l<r<n. 
Then 

E r (AF)>E T {AG) jor all A>0 (7) 
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if and only if 



nx i (/f- 1 G')<i. 



(8) 



In order to prove these results we use some results 
concerning compound matrices and Grassmann prod- 
ucts [2]. We give the coordinate definition of these 
items and list several of their properties. If #1, 
. . ., x r are vectors in the unitary space of w-tuples, 
#<=(#*i, • • •; x in ), 1 <r<n, then X\ a . . . a x r is the 

( Vtuple whose coordinates are the r-square sub- 
determinants of the rXn matrix (Xfj), i=l, . . ., r, 
j=l } . . ., n arranged in lexicographic order. If A 

is an n-square matrix, then C T (A) is the ( )-square 

matrix whose entries are the r-square subdetermi- 
nants of A arranged in doubly lexicographic order 
according to the row and column indices of A. That 
is, if l<2'i<C . . . <^i r <n and 1 <7YC . . . ^jV^?? 
are two increasing sets of r integers then the (z'i, 
. . ., i r ) } (ji, . . ., j r ) element of C r (A) is the deter- 
minant of the matrix A[i u . . ., i r \j u . . .,j r ] whose 
(s, t) entry is a isJr If i s =j s , s=l, • • ., r we denote 
the corresponding principal submatrix by A[i x , . . ., 
i J. If A>0 then C r (A) >0; C r (A)x 1 a ... a x r =Ax a 
... a Ax r ; if ( , ) is the usual unitary inner product in 
the space of m-tuples then (x t a . . . a x ry y x a . . . a y r ) 
=d{(Xi, yj)}, i,j=l, . . ., r. 

We remark that the condition (3) of theorem 1 
will not imply that F>G in the case r<^n. For even 
if G=I, we can easily construct F so that \ n (F)<il 
and yet (3) holds for j) = 0, . . ., r—\. However, 
there is a direct generalization of Stone's result to 
the compound matrix. 

Theorem 3. If F>0, C>0 and l<r<n, then 

C r (I+AF)>C r (I+AG) for all A>0 

if and only if F>G. (9) 

Once again, r=n is precisely the result in [4]. 

2. Proofs 

We prove theorem 1. Let A=UXU*, U unitary, 

J£==diag Ox, . . ., x n ). Then E r (A+F)-E r (A + G) 
= E T (U*AU+U*FU)-E r (U*AU+U*GU) = E r (X 
+ K)-E r (X+H) where K=U*FU>0, H=U*GU 
>0. Let 



<p(x h 



.,x n : U)=E r (X+K)-E r (X+H). 



Then 



<p(x lt . ,.,x n :U)= 2 {d[dmg(x av . . .,x« r ) 

l<ai< . . . <a r <n 

+K [a u . . .,a r ]]—d[disig (x av . . ., Xa) 
+H[a 1 ,...,a r ]]}. (10) 

We see from (10) that <p is a polynomial in xi, . . ., 
x n of degree r in which the nonconstant terms form 
a multilinear polynomial. Let 



<Pil,...i v w> • • •jXn'U)- 



d p v 



i>x t . 



dx t 



(xi, . . ., x n \ U) 



and 



Then 



<P h . 



=Vi 1 .:. i A0,...,0:U). 



<p(x h . ..,a?»:?7)=S S 



P = 1<fi<...<f p <7l 



• X ip <p tl 



(11) 



where the term corresponding to p = is just <^(0, 
. . .,0: U). Let w denote the increasing sequence 
ii < C " - . . <Cv Unless w is a subset of ai<Z . . . <Ca r 
the partial derivative satisfies 



d p 



dXi . . . dXj 



{^[diag^a!, • ••,X aT )-\-K[ai . . .Or]] 



(12) 



— d[diag (x av . . ., Xa r )+H\ 



OLu 



r]}}=0. 



If w is a subset of «!< . . .< a r let << . . .< ajf^be 
the ordered complementary set of w in ai< . . .< a f . 
In this case the derivative in (12) has the value 

d[diag (ay, • • ., ayO+X[«r, • • -, a?-,]] 

-d[diag (a?ar, • . .,«.^)+H[ar, . . .,<£.,]]. (13) 

Setting »!= . . . =a re =0 in (13) we have 

l<ai< . . . <C.a r <n 

-d(H[a?,...,a?- P ])}, (14) 

where X)' indicates that the summation is taken only 
over those a x <i . . .<C a T containing w= (ii<i . . .<C i p ) as 
a subset, Returning to (11) we set x ix =. .. =x t =/and 
all other ^=0. Then 0(0, . . ., t, . . ., t 9 . . . 0: U) = 

^0?! • • • ip +L p -i(t) where L p _ { is a polynomial of degree 
at most p — 1 in t. Since 0(^, . . ., x n \ U)>0 we 
conclude, by letting t increase, that (f>\ . . . t > 0. Con- 
versely, if (t>° h ... ip >0 and 0(0, . . ., 0, U)>0, then 
0(xi, . . ., x n :U)>0 for all non-negative x 1? . . ., x n . 
Let j\<C . . .<^j n - p denote the set complementary to 
ii^ . . ^i P in 1, . . ., n. Let e j: j=l, . . ., n, be 
the unit 7i-tuple with 1 in position j, elsewhere. 
We may rewrite (14) as 



-m'(C r - P (G)U A . . . A ft. 



^A 



. A U a 



(15) 



=S'{dK:[ai,..., ffr . 


.,]-fflk . . . 


a r-p\\ 


=z;m-*(#K 1 A. 


• • a ^,_„, «», a 


■ ■ A e °rJ 


-(C T ^(H)e^A . 


- • A ^ r _„, s A • 


■ ■ A «•_„)} 


= H'(C r - P (F)U ai A.. 


• A «»,_,,, M», A • 


• • A W %-„) 
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where XI' indicates that the summation extends over 
precisely those increasing sequence a { <i . . ,<i<r T _ p 
which are subsets of j x <i . . .<ij n ^ p . Also Uj= Ue h 
j=l, . . ., n is an orthonormal set of vectors (recall 
that U is unitary). We use an extremal result in 
[3: theorem 1, p. 525] to conclude from (15) that 



07, 



>S r -,(Xp+i(F), . . ., \ n (F)) 



-E r -MG),...,\ n -,(G))>0. 

In case 0=1 we have from (15) again that 
4>\...i v = JZ'{C r ^ p {F)u ai A ... a ^ r _ p , 

V ffl A . . . A U arp ) — (?!») 

for all choices of sets of n— p orthonormal vectors 
u u . . ., u n - P , and another application of the above 
cited extremal result completes the proof. 

To proceed to the proof of theorem 2, choose a 
nonsingular R such that F=RR*, G=RDR*, 
D=diag (MF-'G), . . ., \ n (F~ l G)). Let P be an 
arbitrary nonsingular matrix and since any A^>0 is 
of the "form (PR- 1 )* (PR' 1 ) we have that (7) is 
equivalent to 

E r ((PR~ l ) * (PR- 1 ) RR *) > /<; ( (PR- 1 ) * (PiT 1 ) /?/>/?*) , 

E r (P*P)>E r (P*PD), 

E r (A) > E r (AD) for all ,4>0. (16) 



or 



In (16) replace A by VXV*, V unitary, X=diag 

(xi, . . ., x n ) >0 to obtain 



or 



E r (X) > E r (XH) , 11= V*DV 
tr[C r (X)(I-C r (H))]=0, 



(17) 



where /is the ( r w )-square identity matrix. It is not 
difficult to check that (17) holds for all non-negative 
diagonal X if and only if every diagonal element of 
I—C r (II) is non-negative. That is, 

l-(G\(V*DV)e ai A . . . Ag v e ai A . . . a^)>() 

must hold for all V and all l<an< . . <^a r <n. But 
this is precisely equivalent to 

(C r (D)u a * . . *Ua r , u a * . . *u a )<\ 

for all orthonormal u„ v . . . , u ar . As in the proof 
of theorem 1 we have finally that (7) holds if and 
only if (8) does. 

To prove theorem 3 it will be convenient to let = 
denote the relation of Hermitian congruence. Then 
if ^4>0, 

C r (I+AF)-C r (I+AG)=C r (A) 

{C r (A- 1 +F)-C r (A- 1 +G)} = (C r (A))^ 

[C T (A- 1 +F)-C r (A- 1 +G)](C r (A))-^. 



This last matrix has the same roots as the Hermitian 
matrix C r (A- 1 + F)-C r (A~ 1 + G) and hence (9) is 
equivalent to 



Now 



C r (A+F)>(\(A+G) for all ,4>0. 



C r (A+F)-C r (A+G) = C r (A+F)-(C T (G)Y' 2 
C r (G- y2 AG- l/2 +I)(C r (G)y /2 M y r (G- 1/2 AG- 1/2 

+G- l,2 FG- xl2 )-( 1 r (G- l/2 AG- l,2 +I). 

Thus (9) is equivalent to 

C r (A+H)-C r (A+I)>0 lor all A>0 (18) 

where H=G~ m HG- 1/2 . 

By a unitary congruence we may assume H=Amg 
(hi, . . . , h n ) and by setting ^i=diag (x u ..., x n )>0 
we see that 

for any non-negative numbers x iv . . . , x t . This 
clearly implies that each h t >l, t=l, . . . , n. 
Thus, 0>H-I=G- 1/2 FG- 1/2 -I=F-G. 

Conversely suppose F—G>0. Then H>I and if 
we set B=A J rI^>0 we would like to conclude that 



(19) 



(20) 



C r (A+H) = C r (B+II-I)>C r (B) = C r (A+I). 
But (19) is equivalent to 

C r (I+K)>C r (I), K=B- 1I \H-I)B- 112 . 
After a unitary congruence (20) reduces to 
tf r (/+diag (Jfci, . . . , k n ))>C T (I), 



where k a >0 are the characteristic roots of K. The 
proof is complete. 
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